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Raw Scqacncc Listing Error Summaiy 



lo/02 3.'&3| 



0 



ERROR DETECTED SUGGESTED CORRECTION ' SERIAL NUMBER: 

ATTN: NEW RULES CASES: PLEASE DISREGARD ENGLISH ''ALPHA'* HEADERS, WHICH WERE INSERTED BY PTO SOFTWARE 

I Wrapped Nucleics The numberAext at ihe end of each line "wrapped^ down to the next line. This may occur if your file 

Wrapped Aminos was retrieved in a word processor after creating it Please adjust your right margin to .3; this will 
prevent ''wrapping,** 

2 I nvalid Line Length The rules require that a line not exceed 72 characters in length. This includes white spaces. 

3 Misaligned Amino The numbering under each 5"* amino acid is misaligned. Do not use tab codes between numbers; 

Numbering use space characters, instead. 

4 ^Non- ASCII The submitted file was not saved in ASCn(DOS) text, as required by the Sequence Rules. Please 

ensure your subsequent submission b saved In ASCII text 

5 Variable Length Sequences) ^contain n*s or Xaa's representing more than one residue. Per Sequence Rules, 

each n or Xaa can only represent a single residue. Please present the maximum number of each 
residue having variable length and indicate in the <220>-<223> section that some may be missing. 

6 Patentin 2.0 A "bug** in Patentin version 2.0 has caused the <220>-<223> section to be missing from amino acid 

•*bug** sequences(s) . Normally, Patentin would automatically generate this section from the 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section to . 
the subsequent amino acid sequence. This applies to the mandatory <220>-<223> sections for 
Artificial or Unknown sequences.' 

7 Skipped Sequences Sequence(s) missing. If intentional, please insert the following lines for each skipped sequence: 

(OLD RULES) 



(2) INFORMATION FOR SEQ ID NO:X: (insert SEQ ID NO where "X** is shown) 
(t) SEQUENCE CHARACTERISTICS: (Do not insert any subheadings under this heading) 

(xi) SEQUENCE DESCRIPTION:SEQ ID NO:X: (insert SEQ ID NO where "X**^ is shoWn) 
This sequence is intentionally skipped 

Please also adjust the "(") NUMBER OF SEQUENCES:** response to include the skipped sequences. 



8 Skipped Sequences Sequence(s) _ 



(NEW RULES) 



_Use of n*s or Xaa's 
(NEW RULES) 



^ missing. If Intentional, please insert the following lines for each skipped sequence. 



10 



II 



/ 



Invalid <213> 
Response 

Useof<220> 



12 Patentin 2.0 * 



<2I0> sequence id number 
<400> sequence id number 
000 

Use of n's and/or Xaa*s have been detected in the Sequence Listing. 
. Per 1.823 of Sequence Rules, use of <220>-<223> is MANDATORY if n's or Xaa*s are present 
In <220> to <223> section, please explain location of n or Xaa; and which residue n or Xaa represents. 

Per 1.823 of Sequence l&les^ the only valid <213> responses are: Unknown, Artiiicial Sequence, or 
scientific name (Genus/species). <220>-<223> section is required when <213> response is Unknown or 
is Artificial-Sequence 

Sequence(s) missing the <220> "Feature** and associated numeric identifiers and responses. 

Useof <220> to <223> is MANDATORY if <213> **Organism** response is "Artificial Sequence** or 

"Unknown.** Please explain source of genetic materia] in <220> to <223> sectioa 

(See "Federal Register,**06/01/1998, Vol. 63, No. 104, pp. 29631-32) (Sec. 1.823 of Sequence Rules) 

Please do not use "Copy to Disk** function of Patentin version 2.0. This causes a corrupted file, 
resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence 
. listing). Instead, please use "File Manager^ or any other manual means to copy file to floppy disk. 



13 ^Misuse of n n can only be used to represent a single nodcotlde in a nucleic acid sequence. N is not used to rqiresent 

any value not specifically a nucleotide. 



AMC/MH - Biotechnology Systems Branch - 08/21/2001 



RAW SEQUENCE LISTING 
ERROR REPORT 



; BIOTECHNOLOGY 
SYSTEMS 
BRANCH 




The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIQ detected errors when processing the following computer readable 
form: 



Application Serial Number: 
Source: 

' Date Processed by STIC: 



\0 



THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPUCANT BY EriHER: , 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPUCANT, Wrra A NOTICE TO COMPLY or, 

2) .^TELEPHONING APPUCANT AND FAXING A COPY OF THIS PRiNTOUT* WITH A ;> « . 

NOTICE t6 COMPLY ■ ' • ■•'.7. v-,, ■• 

FOR CRF SUBMISSION QUESTIONS, PLEASE CONTACT MARK SPENCER, 703«30H212. 

FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT WAX, 703-30fr4216. 
PATENTIN 2.1 e-mail help; patin2 1 help@nspto.gOv or phone 703-306-4119 (R. Wax) 
PATENTIN 3.0 e-mail help: patin3help@uspto.gov or phone 703-306-4119 (R. Wax) 





TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
> VERSION 3,1 PROGRAM. ACCESSIBLE THROUGH THE U.S. PATENT AND.^ . v ^ 
' TRADEMARK OFFICE WEBSITE. SEE BELOW FOR ADDRESS: Y^M ^ ^ 
http;//www,uspto,gov/wcb/office5/pac/chccker 

'^^^AppiUM^ sequence tt^fonnadon elect^^ diskette or CD-R6ni shchiMj^ 

a possibaity iihat die dkk/CD-Rom may have been affected by treatment given to all incoming ji^.^^^^^' '^ - - 7|| 
I^JPlease ^^^ alternate methods of submission for the disk/CIMlom or replacemcaitdi^pE^Lp^ ^ 



Any ieplv including a sequence listing in electronic fomi should NOT be sent to flie 20231 iiy^eaddless fbip the 



United States Patent airf Trademark Office, and instead should be sent via the following to the indicated addresses: 

1. EFS-Bio (<http://www.ospto.gov/ebc/efs/dovmloads/documentsAtm> , EES Submission 
^ User Manual -ePAVE) ■ ^. 

2. U.S. Postal Service: U.S. Patent and Trademark Office, Box Sequence, P.O. Box 2327, Arlington, VA 22202 

3. Hand Carry directty to: 

U.S. Patenrand Itedemaik Office, Technology Center 1600, Reception Area, 7^ Floor, Examiner Name, 
Sequence Ihfomiation, Otystal NfaU One, 1911 SouA Oait Str^ 
Or 

U.S. Patent and Tiademaik Office, Box Sequence, Customer Window, Lobby, Room 1B03, Crystal Plaza Two, 
201 1 South Clark Place, Arlington, VA 22202 

4. Federal Express, United Parcel Service, or thcr delivery service to: U.S. Patent and Trademark Office, 
B x Sequeifce, Room 1B03-Mailroom, Ciystal Plaza Two, 201 1 South Claik Place, Arlington, VA. 22202 

Revised 01/29/2002 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/10/023,831 



OIPE 



Doss Not Comi 
CoTOcted Diskette 

DATE: 04/16/2002 
TIME: 14:35:38 



3 
4 
5 
6 
8 

HELICAL 
9 
11 
13 
14 
16 
17 
19 
20 
22 
23 
25 
26 
28 
30 
32 
33 
34 
35 
37 
38 
41 
42 
43 
44 
46 
47 
50 
51 
52 
53 
55 
56 
59 
60 
61 
62 
64 
65 



Input Set : A:\EP.txt 

Output Set: N:\CRF3\04162002\J023831.raw 

<110> APPLICANT: VAUGHAN, Paul R. 

GALANIS, Maria 

RAMSHAW, John A.M. 

WERKMEISTER, Jerome A. 
<120> TITLE OF INVENTION: METHOD FOR PRODUCING, IN YEAST, A HYDROXYLATED TRIPLE 
PROTEIN, 

AND YEAST HOST CELLS USEFUL IN SAID METHOD 
<130> FILE REFERENCE: Q67867 

<140> CURRENT APPLICATION NUMBER: 10/023,831 

<141> CURRENT FILING DATE: 2001-12-21 

<150> PRIOR APPLICATION NUMBER: US 09/297,269 



1999-04-28 
NUMBER: PO3310 

1996-10-29 
NUMBER: PO4306 

1996- 12-19 
NUMBER: PCT/AU97/00721 

1997- 10-29 



[:(Si 

:: r 



Synthetic Construct 



<151> PRIOR FILING DATE 
<150> PRIOR APPLICATION 
<151> PRIOR FILING DATE 
<150> PRIOR APPLICATION 
<151> PRIOR FILING DATE 
<150> PRIOR APPLICATION 
<151> PRIOR FILING DATE 
<160> NUMBER OF SEQ ID NOS : 52 
<170> SOFTWARE: Patentin version 3.1 
<210> SEQ ID NO: 1 
<211> LENGTH: 26 
<212> TYPE: DNA 
<213> ORGANISM 
<400> SEQUENCE 
ctgtagagga tccccgggta cggagc 
<210> SEQ ID NO: 2 
<211> LENGTH: 23 
<212> TYPE: DNA 

<213> ORGANISM: Synthetic Construct 

<400> SEQUENCE: 2 

ttatattgaa ttctcaaaaa ttc 

<210> SEQ ID NO: 3 

<211> LENGTH: 32 

<212> TYPE: DNA 

<213> ORGANISM: Synthetic Construct 
<400> SEQUENCE: 3 

tgtaaaatta aaggatccca aagatgtggt at 
<210> SEQ ID NO: 4 
<211> LENGTH: 30 
<212> TYPE: DNA 

<213> ORGANISM: Synthetic Construct 

<400> SEQUENCE: 4 

gccgggatcc tgtcattcca atgacaacgt 



TdQ typQ of errors shown Gxtet thFQUghout 
KlQ S^u^ncd Listing. Please check @u&csqolS 
SSquenoes for sndsr enors. 



26 



23 



32 



30 



file://C:\Crf3\Outhol(^VsrJ02383 Lhtm 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/IO/023 , 831 



DATE: 04/16/2002 
TIME: 14:35:38 



Input Set : A:\EP.txt 

Output Set: Nj\CRF3\04162002\J023831 .raw 



68 <210> SEQ ID NO: 5 

69 <211> LENGTH: 44 

70 <212> TYPE: DNA 

71 <213> ORGANISM: Synthetic Construct 

73 <400> SEQUENCE: 5 

74 actggacgga tcccgagcgc cccgcctgct ccgtgtccga catg 44 

77 <210> SEQ ID NO: 6 

78 <211> LENGTH: 24 

79 <212> TYPE: DNA 

80 <213> ORGANISM: Synthetic Construct 

82 <4 00> SEQUENCE: 6 

83 ggttctcctt ggtgacctcc cctt 24 

86 <210> SEQ ID NO: 7 

87 <211> LENGTH: 24 

88 <212> TYPE: DNA 

89 <213> ORGANISM: Synthetic Construct 

91 <400> SEQUENCE: 7 

92 gaaggggagg tcaccaagga gaac 24 

95 <210> SEQ ID NO: 8 

96 <211> LENGTH: 36 

97 <212> TYPE: DNA 

98 <213> ORGANISM: Synthetic Construct 

100 <400> SEQUENCE: 8 

101 ccttcaggat cctattagac ttcatctttc aacagc 36 

104 <210> SEQ ID NO: 9 

105 <211> LENGTH: 44 

106 <212> TYPE: DNA 

107 <213> ORGANISM: Synthetic Construct 

109 <400> SEQUENCE: 9 

110 actggacgga tcccgagcgc cccgcctgct ccgtctccga catg 44 

113 <210> SEQ ID NO: 10 

114 <211> LENGTH: 35 

115 <212> TYPE: DNA 

116 <213> ORGANISM: Synthetic Construct 

118 <400> SEQUENCE: 10 

119 ccttcaggat cctattagac ttcatctttc acagc 35 

122 <210> SEQ ID NO: 11 

123 <211> LENGTH: 39 

124 <212> TYPE: DNA 

125 <213> ORGANISM: Synthetic Construct 

127 <400> SEQUENCE: 11 

128 ctagttgaat tctacacaat gctgcgccgc gctctgctg 39 

131 <210> SEQ ID NO: 12 

132 <211> LENGTH: 36 

133 <212> TYPE: DNA 

134 <213> ORGANISM: Synthetic Construct 

136 <400> SEQUENCE: 12 

137 gcaatggaat tcttattaca gttcgtgcac agcttt 36 
140 <210> SEQ ID NO: 13 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/10/023,831 



DATE: 04/16/2002 
TIME: 14:35:38 




Input Set : A:\EP.tzt 

Output Set: N:\CRF3\04162002\J023831.raw 

141 <211> LENGTH: 4 

142 <212> TYPE: PRT 

143 <213> ORGANISM 
145 <400> SEQUENCE^ 

147 His Asp Glu Leu 

148 1 

151 <210> SEQ ID NO 

152 <211> LENGTH: 5 

153 <212> TYPE: PRT 

154 <213> ORGANISM: 
156 <400> SEQUENCE: 

158 Lys Asp Ala Glu Leu 

159 1 5 

162 <210> SEQ ID NO: 15 

163 <211> LENGTH: 42 

164 <212> TYPE: DNA 

165 <213> ORGANISM: Synthetic Construct 

167 <400> SEQUENCE: 15 

168 gctagcaagc ttggagctcc aggcccactt gggattgctg gg 

171 <210> SEQ ID NO: 16 

172 <211> LENGTH: 45 

173 <212> TYPE: DNA 

174 <213> ORGANISM: Synthetic Construct 

176 <400> SEQUENCE: 16 

177 tcgcgatcta gattataaaa agcaaacagg gccaacgtcc acacc 

180 <210> SEQ ID NO: 17 

181 <211> LENGTH: 6 

182 <212> TYPE: DNA 

183 <213> ORGANISM: 

185 <400> SEQUENCE: 

186 gctagc 

189 <210> SEQ ID NO: 

190 <211> LENGTH: 6 

191 <212> TYPE: DNA 

192 <213> ORGANISM: 

194 <400> SEQUENCE: 

195 aagctt 

198 <210> SEQ ID NO: 

199 <211> LENGTH: 6 

200 <212> TYPE: DNA 

201 <213> ORGANISM: 
203 <400> SEQUENCE: 

tctaga 

<210> SEQ ID NO: 
<211> LENGTH: 6 

209 <212> TYPE: DNA 

210 <213> ORGANISM: 

212 <400> SEQUENCE: 

213 tcgcga 



42 



45 



204 
207 
208 



Synthetic Construct 
17 

18 



Synthetic Construct 
18 



19 



Synthetic Construct 
19 

20 



Synthetic Construct 
20 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/10/023,831 



DATE: 04/16/2002 
TIME: 14:35:38 



Input Set : A:\EP.txt 

Output Set: N:\CRF3\04162002\J023831.raw 



216 <210> SEQ ID NO: 21 

217 <211> LENGTH: 21 

218 <212> TYPE: PRT 

219 <213> ORGANISM: Synthetic Construct 
221 <400> SEQUENCE: 21 

223 Asp Tyr Lys Asp Asp Asp Asp Lys Ala Ser Lys Leu Gly Ala Pro Gly 

224 15 10 15 

227 Pro Leu Gly lie Ala 

228 20 

231 <210> SEQ ID NO: 22 

232 <211> LENGTH: 9 

233 <212> TYPE: PRT 

234 <213> ORGANISM: Synthetic Construct 
236 <400> SEQUENCE: 22 

238 Gly Pro Pro Gly Pro Pro Gly Leu Ala 

239 1 5 
242 <210> SEQ ID NO: 23 
24 3 <211> LENGTH: 9 

244 <212> TYPE: PRT 

245 <213> ORGANISM: Synthetic Construct 
247 <400> SEQUENCE: 23 

249 Gly Pro Pro Gly Pro Pro Gly Glu Arg 

250 1 5 

253 <210> SEQ ID NO: 24 

254 <211> LENGTH: 9 

255 <212> TYPE: PRT 

256 <213> ORGANISM: Synthetic Construct 
258 <400> SEQUENCE: 24 

260 Gly Pro Pro Gly Pro Pro Gly Pro Ala 

261 1 5 

264 <210> SEQ ID NO: 25 

265 <211> LENGTH: 9 

266 <212> TYPE: PRT 

267 <213> ORGANISM: Synthetic Construct 
269 <400> SEQUENCE: 25 

271 Gly Pro Pro Gly Pro Pro Gly Ala Pro 

272 1 5 

275 <210> SEQ ID NO: 26 

276 <211> LENGTH: 9 

277 <212> TYPE: PRT 

278 <213> ORGANISM: Synthetic Construct 
280 <400> SEQUENCE: 26 

282 Gly Leu Ala Gly Ala Pro Gly Leu Arg 

283 1 5 

286 <210> SEQ ID NO: 27 

287 <211> LENGTH: 45 

288 <212> TYPE: PRT 

289 <213> ORGANISM: Synthetic Construct 
291 <400> SEQUENCE: 27 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/IO/023 , 831 



DATE: 04/16/2002 
TIME: 14:35:38 



Input Set : 
Output Set: 



A:\EP.txt 

Nj\CRP3\04162002\J023831.raw 



293 Gly Gly Lys 

294 1 



Gly Asp Ala 
5 



Gly Ala Pro Gly Glu Arg Gly Pro Pro Gly 



10 15 



297 Leu Ala Gly 
298 



Ala Pro Gly 
20 



Leu Arg Gly Gly Ala Gly Pro Pro Gly Pro 



25 30 



301 Glu Gly Gly 

302 35 



Lys Gly Ala 



Ala Gly Pro Pro Gly Pro Pro 



40 45 



305 <210> SEQ ID NO: 28 

306 <211> LENGTH: 7 

307 <212> TYPE: PRT 

308 <213> ORGANISM: Synthetic Construct 
310 <400> SEQUENCE: 28 

312 Met Gly Ala Pro Gly Ala Pro 

313 1 5 

316 <210> SEQ ID NO: 29 

317 <211> LENGTH: 6 

318 <212> TYPE: PRT 

319 <213> ORGANISM: Synthetic Construct 
321 <400> SEQUENCE: 29 

323 Gly Ala Pro Gly Ala Pro 

324 1 5 

327 <210> SEQ ID NO: 30 

328 <211> LENGTH: 7 

329 <212> TYPE: PRT 

330 <213> ORGANISM: Synthetic Construct 
332 <4 00> SEQUENCE: 30 

334 Gly Leu Glu Gly Pro Arg Gly 

335 1 5 

338 <210> SEQ ID NO: 31 

339 <211> LENGTH: 26 

340 <212> TYPE: DNA 

341 <213> ORGANISM: Synthetic Construct 

343 <400> SEQUENCE: 31 

344 aattccatgg gtgctccagg tgctcc 26 

347 <210> SEQ ID NO: 32 

348 <211> LENGTH: 22 

349 <212> TYPE: DNA 

350 <213> ORGANISM: Synthetic Construct 

352 <400> SEQUENCE: 32 

353 ggccacctgg tggacctggt gg 22 

356 <210> SEQ ID NO: 33 

357 <211> LENGTH: 22 

358 <212> TYPE: DNA 

359 <213> ORGANISM: Synthetic Construct 

361 <400> SEQUENCE: 33 

362 ggcccggtgg taagggtgac gc 22 

365 <210> SEQ ID NO: 34 

366 <211> LENGTH: 20 

367 <212> TYPE: DNA 

368 <213> ORGANISM: Synthetic Construct 
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VERIFICATION SUMMARY DATE: 04/16/2002 

PATENT APPLICATION: US/10/023,831 TIME: 14:35:39 

Input Set : A:\EP.txt 

Output Set: N:\CRF3\04162002\J023831, raw 
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